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Abstract — We consider the problem of transmitting classical 
and quantum information reliably over an entanglement-assisted 
quantum channel. Our main result is a capacity theorem that 
gives a three-dimensional achievable rate region. Points in the 
region are rate triples, consisting of the classical communication 
rate, the quantum communication rate, and the entanglement 
consumption rate of a particular coding scheme. The crucial 
protocol in achieving the boundary points of the capacity region is 
a protocol that we name the classically-enhanced father protocol. 
The classically-enhanced father protocol is more general than 
other protocols in the family tree of quantum Shannon theoretic 
protocols, in the sense that several previously known quantum 
protocols are now child protocols of it. The classically-enhanced 
father protocol also shows an improvement over a time-sharing 
strategy for the case of a qubit dephasing channel — this result 
justifies the need for simultaneous coding of classical and quan- 
tum information over an entanglement-assisted quantum channel. 
Our capacity theorem is of a multi-letter nature (requiring a 
limit over many uses of the channel), but it reduces to a single- 
letter characterization for at least three channels: the completely 
depolarizing channel, the quantum erasure channel, and the qubit 
dephasing channel. 

Index Terms — quantum Shannon theory, entanglement-assisted 
quantum channel, entanglement-assisted classical-quantum cod- 
ing, classically-enhanced father protocol 

I. Introduction 

The communication of information over a noisy quantum 
channel is a fundamental task in quantum communication 
theory. A sender may wish to transmit classical informa- 
tion, quantum information, or both. The Holevo-Schumacher- 
Westmoreland (HSW) coding theorem gives an achievable rate 
at which a sender can transmit classical data to a receiver if she 
transmits the classical information over a noisy quantum chan- 
nel ID, lIJl. The HSW theorem generalizes Shannon's classical 
channel coding theorem [JJ to the quantum setting. The Lloyd- 
Shor-Devetak (LSD) coding theorem gives an achievable rate 
at which a sender can transmit quantum data to a receiver 
through a quantum channel [4J, [5|, [6|. Devetak and Shor 
followed up on these results by determining achievable rates at 
which a sender can simultaneously transmit both classical and 
quantum information over a quantum channel |7|. The naive 
scheme is to employ a time-sharing strategy, where a sender 
uses an HSW code for a fraction of the transmitted qubits 
and an LSD code for the other fraction. The Devetak-Shor 
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coding strategy outperforms the naive time-sharing strategy, at 
least when the noisy channel is the qubit dephasing channel 
|8|. This result demonstrates the need to consider non-trivial 
coding schemes when communicating more than one resource. 

A sender can exploit a quantum channel alone, as in the 
above examples, or she can exploit assisting resources as well. 
Examples of such assisting resources are a static resource 
shared with the receiver, as in the case of common ran- 
domness, secret key, or entanglement, or a dynamic resource 
connecting the sender to the receiver, as in the case of a 
noiseless classical or quantum side channel. 

Assisting a quantum channel with noiseless resources some- 
times improves communication rates. The simplest and most 
striking example of this phenomenon occurs when a noise- 
less ebit assists a noiseless qubit channel. The super-dense 
coding protocol outlines a simple method to transmit two 
classical bits over a noiseless qubit channel assisted by an 
ebit 19|. This protocol beats the Holevo bound |8|, which 
limits an unassisted noiseless qubit channel to transmit no 
more than one classical bit. The super-dense coding protocol 
then led Bennett et al. to explore if one could improve the 
classical capacity of a noisy quantum channel by assisting 
it with unlimited entanglement lITOll . They confirmed 

their intuition by proving a channel coding theorem that gives 
an entanglement-assisted classical transmission rate higher 
than that without assistance. Shor then refined this result by 
determining trade-offs between the classical communication 
rate and the entanglement consumption rate lfT2l . 

Quantum information theorists have since organized pro- 
tocols that exploit the different resources of quantum com- 
munication, classical communication, and entanglement into a 
family tree |13|, QH, |[T3, |il6J. One member of the family 
tree is the father protocol lfT3l . lfT4l . The father protocol is 
so named because it generates several "child" protocols using 
the theory of resource inequalities fT3l . lfT4ll . Devetak et al. 
exploited the father protocol to demonstrate trade-offs between 
the quantum communication rate and the entanglement con- 
sumption rate over an entanglement-assisted quantum channel 
Ifl3l. 

An important natural question, in light of the aforemen- 
tioned trade-off solutions for two of the three noiseless re- 
sources, is then how one might combine all three different 
resources. Previous work has addressed trade-offs for the task 
of remotely preparing quantum states with the aid of classical 
communication, quantum communication, and entanglement 
flTl, but no one has yet considered the triple trade-offs for 
channel coding. 

In this article, we conduct an investigation of the trade-offs 
for channel coding both quantum and classical information 
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over a quantum channel assisted by noiseless entanglement. 
We prove the entanglement-assisted classical and quantum 
capacity theorem, that gives achievable rates for this task. 
We extend the family tree of quantum Shannon theory by 
developing the classically-enhanced father protocol^ This 
protocol is more general than any of the existing protocols in 
the tree and achieves rates in the three-dimensional capacity 
region. We dub this protocol the "classically-enhanced father 
protocol" because it is an extension of the father protocol, 
and it generates five child protocols in the sense of Refs. ifTSl . 
llT4l . Two of its child protocols are classically-enhanced quan- 
tum communication [7| and entanglement-assisted classical 
communication ifTOl , ifTTI . lfT2l (we detail the others in Sec- 
tion VI-Fi. We also demonstrate that isometric encodings are 



sufficient for achieving our rate formulas, resolving an open 
problem from Ref. Ifl4l . 

A benefit of the classically-enhanced father protocol is that 
it inspires the design of classically-enhanced entanglement- 
assisted quantum error-correcting codes |fT9l , ll20l . We give 



evidence in Section VIII-B that it is possible to reach the 



achievable rates without encoding classical information into 
the entanglement shared between the sender and receiver. 

We structure this article as follows. In the next section, 
we give some definitions and establish notation used in the 



remainder of the article. Section III provides a description of 
a general protocol for communication of classical and quantum 
information with the assistance of entanglement. We then state 
the main capacity theorem. Theorem [T] in Section IV and 
show how the classical capacity theorem |2|, the quan- 
tum capacity theorem Q, |6|, the classically-enhanced 
quantum capacity region JT), the father capacity region lfT4l . 
and the entanglement-assisted classical capacity region [03 
are all special cases of the entanglement-assisted classical and 
quantum capacity region. We prove the converse of Theorem [T] 
in Section IV] and prove the direct-coding part of Theorem U\ 
in Section [VT] Section |VI-F| discusses the child protocols 
that the classically-enhanced father protocol generates. We 
then give three example channels, the completely depolarizing 
channel, the quantum erasure channel, and the qubit dephasing 
channel, that admit a single-letter solution for the capacity 
region (meaning that we have a complete understanding of 
the capacity region for these channels). We also show that 
the classically-enhanced father protocol gives an improvement 
over a time-sharing strategy when the noisy channel is the 
qubit dephasing channel. We end by summarizing our results 
and by posing several open questions. 

II. Definitions and Notation 

The ensemble {p (x) , ''Px^^} ^^^^ where each state 
is a pure tripartite state, is essential in the ensuing analysis of 
this article. The coherent information I {A)B)^ of each state 
ipx^^ in the ensemble is as follows: 



I[A)B] 



H{B)^^-H{AB) 



'As a side note, we mention tliat former articles discuss tlie possibility of 
this protocol but never fully developed it |18|, |14|. In addition, the current 
authors have both constructed "classically-enhanced father" error-correcting 
coding schemes for block codes 1 19| and for convolutional codes 1201 . 



where H {B)^ is the von Neumann entropy of the reduction 
of the state i^^'^^ to the system B with a similar definition 
for H (AB)^ . The quantum mutual information I [A] B)^ 
of each state is as follows: 



I{A;B: 



HiA)^^+IiA)B)^ 



We can classically correlate states in some system X with 
each state ip^^^ to produce an augmented ensemble 

L J xex 

where the set {la;)}^^;^^ is some preferred orthonormal basis 
for the auxiliary system X. The expected density operator of 
this augmented ensemble is the following classical-quantum 
state: 

a' 



xex 

The Holevo information of the classical variable X with the 
quantum system B is I {X; B)^. For the special case of a 
classical system X, taking the expectation of the above en- 
tropic quantities with respect to the density p {x) gives the re- 
spective conditional entropy H {A\X)^, conditional coherent 
information I {A)B\X)^, and conditional mutual information 
I{A-B\X)^: 



nA)B\X)^ 
nA;B\X)^ 



x£X 

E 

xi^X 

E 

xex 



p{x)I{A)B)^^, 
p{x)I{A-B)^^. 



One can easily prove that / {A)B\X)^ ^ I {A)BX)^. We use 
the notation I {A)BX)^ for conditional coherent information 
in what follows. The above definitions lead to the following 
useful identities: 

H (A\X)^ - h {A- B\X)^ + h {A; E\X)^ , (1) 

/ {A)BX)^ = \l {A- B\X)^ - \l [A; E\X)^ . (2) 

Proving the above identities is a simple matter of noting that 
the von Neumann entropy is equal for the reduced systems 
of a pure bipartite state. Adding the above identities gives the 
following one: 



H {A\X)^ + / {A)BX)^ = I [A- B\X)^ 



(3) 



The chain rule for quantum mutual information proves to be 
useful as well: 



I{AX- B), - I{A; B\X), + I{X; B),. 



(4) 



All of the above information quantities possess operational 
interpretations in the theorems in this article. 



A noisy quantum channel H 



A'^B 



acts as a completely- 



positive trace-preserving (CPTP) map. It takes a quantum 
system A' as an input and produces a noisy output quantum 
system B. 

A conditional quantum encoder ^ conditional 

quantum channel 121], is a collection [S^-^^] ^ of CPTP 
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maps. Its inputs are a classical system M and a quantum 
system A and its output is a quantum system B. A classical- 



quantum state p 
P 



MA 



, where 



M 



can act as an input to the conditional quantum encoder 
gMA^B^ The action of the conditional quantum encoder 
gMA^B jjjg classical-quantum state p*^"^ is as follows: 

^MA^B ^pMA^ 



■ Ttj^j < (m) |m) (m|*^ (g) £. 

\ m. 




It is actually possible to write any quantum channel as a 
conditional quantum encoder when its input is a classical- 
quantum state ETI . In this article, a conditional quantum 
encoder functions as the sender Alice's encoder of classical 
and quantum information. 

A quantum instrument X)^~^^^^ is a CPTP map whose input 
is a quantum system A and whose outputs are a quantum 
system B and a classical system M Iil4il . |2jJ. A collection 
{T^m~^^}m completely-positive trace-reducing maps spec- 
ifies the instrument j)^^^^ . The action of the instrument 
jyA^BM arbitrary input state p is as follows: 



V 



A~¥BM 



(5) 



Tracing out the classical register M gives the induced quantum 
operation T>^~^^ where 



A^B 



^A\ _ 



[P 



m 

This sum map is trace preserving: 



A^B 



,A^B / A) 



= 1. 



Tr J^P; 

\, m ) 

We can think of the following quantity 

pHp^)^Tr{2?,r^ (p^)}, 

as a conditional probability p (mjp'^) of receiving the classical 
message m when the state p^ is input. In this article, a 
quantum instrument functions as Bob's decoder of classical 
and quantum information. 

We abbreviate a capacity region by the noiseless resources 
involved: classical communication (C), quantum communica- 
tion (Q), or entanglement (E), but we abbreviate a protocol 
with a different name corresponding either to its inventors 
or an appropriate acronym. For example, we speak of the 
C, Q, or CE capacity theorems for classical communication, 
quantum communication, and entanglement-assisted classical 
communication, respectively, but the corresponding protocols 
are Holevo-Schumac her- Westmoreland coding (HSW), Lloyd- 
Shor-Devetak coding (LSD), and entanglement-assisted clas- 
sical coding (EAC). 

We note some other points before beginning. The trace norm 
ll^llj^ of an operator A is as follows: 



lAlli =Tr{\/Iu| 



The maximally entangled state on system Ta and Tb is 
(^TaTb 'pjjg omission of a superscript implies a reduced state, 
e.g., the state <i>^^ is the reduced state of ^^-i^^^ on Ta- Yard's 
thesis ll2n provides a good introduction to quantum Shannon 
theory, and we point the reader there for properties such as 
strong subadditivity V12i and the quantum data processing 
inequality 



III. A General Protocol for 
Entanglement-Assisted Communication of 
Classical and Quantum Information 

We begin by defining a general protocol for entanglement- 
assisted communication of classical and quantum information 
(EACQ) for a noisy quantum channel connecting a sender 
Alice to a receiver Bob. Alice would like to communicate 
two items to Bob: 

1) An arbitrary quantum state p"^^ in a system Ai with 
dimension 2"*^. 

2) One of 2"^ classical messages. 

Alice and Bob also share entanglement in the form of a 
maximally entangled state $^^^^3 prior to communication. 
Alice possesses the system Ta, Bob possesses the system Tb, 
and the dimension of each system is 2"^. We can think of 
this state as possessing nE ebits of entanglement because it is 
equivalent by local isometries to nE "gold standard" ebits in 
the state \^+)AB ^ (jOO)'^'^ + |ll)'^^)/\/2. Alice performs a 
conditional quantum encoder £MAiTa~^a jj^^j. gjjQodes both 
her quantum systems Ai and Ta and the classical message in 
system M. The encoding operation ^^'^^iTa^a pj-gpares a 
system A'" for input to a noisy quantum channel "-*-b" 
The channel "^b" j-gpresents n independent uses of the 
noisy quantum channel 



A'" 



She then sends her state through the quantum channel 
j^A -i-B g^i^ receives the system B" and performs a 
decoding instrument T>^ Tb^BiBeM channel output 

and his half of the entanglement Tb- The instrument 
j^b^Tb^BiBeM produces a system Bi with the quantum 
information that Alice sent, a classical register M containing 
Alice's classical message, and another system Be that does not 
contain any useful information. Bob should be able to identify 
the classical message with high probability and recover the 
state p"^i with high fidelity. Figure [l] provides a detailed 
illustration of this protocol. 

It is useful to consider the isometric extension -^be 
of the channel where Alice controls the channel 

input system A' , Bob has access to the channel output system 
B, and the environment Eve has access to the system 
For an independent and identically distributed (IID) channel 
j^A "^-B" defined above, we write its isometric extension 
as ^ - Also, it is useful to think of Alice's quantum 

system p^^ as a restriction of some pure state Lp^^-i where 
Alice does not have access to the purification system R- 

-It should be clear from context when E refers to Eve's system or when 
it refers to the entanglement consumption rate. 
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Fig. 1. (Color online) The above figure depicts a general EACQ protocol. A 
sender Alice would like to communicate the quantum information in system 
Ai and the classical information in system M. Her system represents 
shared maximal entanglement with the receiver's system Tg. Alice encodes 
her information and uses the noisy channel a large number of times. The 
environment Eve obtains part of the output and the receiver Bob obtains 
the other part. Bob combines his received systems with his half of the 
entanglement and performs a decoding operation to recover both the classical 
and quantum information. 



We formalize the EACQ quantum information processing 
task as follows. Define an {n, C\ Q, E, e) EACQ code by 
• Alice's conditional quantum encoder ^a/AiTa-s-a .^^j^j^ 
encoding maps {£.^^^^^^^ }rne[2"C]. This encoder en- 
codes both her quantum information and classical in- 
formation. Define the following states for each classical 
message ni: 



RA'"Tb = gAiTA^A'" 



RAi 



(6) 



where the dimension of system Ai is 2"'^ and the 
dimension of system Ta is 2"^. The density operator 
that includes the classical register AI and averages over 
all classical messages is as follows: 



,MRA'"Tb 



where \M\ is the size of the classical register M. The 
output of the channel given that Alice sent classical 
message ni is then as follows; 



The average output of the channel is as follows: 



MRB'^E"Tb 



ttA'"^B"E" 



MRA'"Te 



(8) 



Bob's decoding instrument 2?-B"Tb^SiBeM^ whose ac- 
tion is defined in (jSj), is a collection of completely- 
positive trace-reducing maps {P,^ "^^^■^^^^}me[2"C]. 
The decoding instrument decodes both the quantum in- 
formation and classical information that Alice sends. The 
density operator corresponding to Bob's output state is as 
follows: 

^ AIRBiBeME"- _ j^B^Tb^BiBeM /',,,MJ?.B"£;"Ti 



The classical probabiUty of successful transmission of mes- 
sage TO is as follows: 

Pr{M = TO I M - to} = Tr{(2?,f;^«^^i^^)(w^^"^"^«)}, 

where M denotes the random variable corresponding to Bob's 
received classical message. The final state on the reference 
system R and Bob's quantum system Bi is T^^^ where 



T 



RBi 



Tr - 



B'^Tb-^BiBeM RB'^E'^Tb 



')}■ 



For the (n, C, Q, E, e) EACQ code to be "e-good," the 
following two conditions should hold for all classical messages 

TO e [2"^]: 

1) Bob decodes any of the classical messages to with high 
probability 



Pr{M = TO I A/ = to} > 1 - e, 



(9) 



2) The state T^^^ should be e-close to the ideal state 



-^RBi 



RBi 



(10) 



so that Bob recovers the quantum information in system 

Ai with high fidelity. 
A rate triple {C,Q,E) is achievable if there exists an 
{n,C ~d,Q-S,E + S,e) EACQ code for any e,S > 
and sufficiently large n. The capacity region C{J\f) is a 
three-dimensional region containing all achievable rate triples 
iC\Q,E). 

IV. The Entanglement-Assisted Classical and 
Quantum Capacity Theorem 

We now state our main theorem: the entanglement-assisted 
classical and quantum capacity (CQE) theorem that involves 
all three noiseless resources. 

Theorem 1: The capacity region Ccqe{-^) of an 
entanglement-assisted quantum channel Af for simultaneously 
transmitting both quantum information and classical 
information is equal to the following expression: 



Ccqe(A/') = U ^C^eIA^' 



fc=i 



(11) 



where the overbar indicates the closure of a set. The "one- 
shot" region C^Qg(A/') is the union of the regions C^Qg a-(^)- 



where C^Qj,„{Af) is the set of all C, Q, > 0, such that 

C + 2Q <I{AX;B),, (12) 
Q<I{A)BX), + E, (13) 
C + Q <I{X;B), + I{A)BX), + E. (14) 

The above entropic quantities are with respect to a "one-shot" 
quantum state ct^^be ^jjgj-g 



xabe _ 



tA'^BE ( lAA' 



(15) 



5 




Fig. 2. (Color online) An example of the one-shot, one-state achievable region Cj,Qg^ (A^) corresponding to a state q-^^be jj^^j gj-jsgj from a qubit 
dephasing channel with dephasing parameter p = 0.2. The state input to the channel J\f is a-^^^ , defined in jl6| . The plot features seven achievable corner 
points of the one-shot, one-state region. We can achieve the convex hull of these eight points by time-sharing any two different coding strategies. We can 
also achieve any point above an achievable point by consuming more entanglement than necessary. The seven achievable points correspond to the father 
protocol (EAQ) 1131 . 1141 , the Devetak-Shor protocol for classically-enhanced quantum communication (CEQ) |7|, Shor's protocol for entanglement-assisted 
classical communication with limited entanglement (EAC) |12|, quantum communication (LSD) |4|, |5|, |6|, combining CEF with entanglement distribution 
and super-dense coding (CEF-SD-ED) as detailed in Section |VI-F| the classically-enhanced father protocol (CEF) outlined in Section |VI| and combining 
the classically-enhanced father protocol with teleportation |24| (CEF-TP). Observe that we can obtain EAC by combining CEF with super-dense coding as 
detailed in Section [vFf] so that the points CEQ, CEF, EAC, and CEF-SD-ED all lie in plane III. Observe that we can obtain CEQ from CEF by entanglement 
distribution and we can obtain LSD from EAQ and EAQ from CEF-TP, both by entanglement distribution. Thus, the points CEF, CEQ, LSD, EAQ, and 
CEF-TP all lie in plane II. Finally, observe that we can obtain all corner points by combining CEF with the unit protocols in (6T163}. This one-shot, one-state 
achievable region for the state o-^^^SB tight. The bounds in jl2|l4| uniquely specify the respective planes I-III. We obtain the full achievable region by 
taking the union over all states a of the one-shot, one-state regions {Af) and taking the regularization, as outlined in Theorem^ The above region is a 
translation of the unit resource capacity region to the classically-enhanced father protocol. 



the States (j)^^' 
niin { I A' 



are pure, and it is sufficient to consider 
B\f + lhy the method in Ref. f25\. 



X\ < 



The capacity region in Theorem [T is a union of general 



I2p4 



polyhedra, each specified by 
over all possible states of the form 
infinite number of uses of the channel 



where the union is 



15 1 and a potentially 
Figure [2] illustrates an 
example of the general polyhedron specified by (12p4| , where 
the channel is the qubit dephasing channerl with dephasing 
parameter p = 0.2, and the input state is 



a^^^'^lm (or 



where 



\<f>if^' = v/3/4|00)"" + v/l/4|ll) 

resulting from the channel is 

;re U^f i 

of the qubit dephasing channel. The figure caption provides a 



,AA' 



yAA' 



yAA' 



(16) 



XABE 



The state a 

U^^^^{a'^^^) where Uj\f is an isometric extension 



The action of the qubit dephasing channel with dephasing parameter p on 
a density operator pisp— — p)p + pZpZ. 



detailed explanation of the one-shot, one-state region C^gg 
(note that Figure |2] displays the one-shot, one-state region 
and does not display the full capacity region). 

The above capacity region has the simple interpretation. 
In Ref [26], we determined a unit resource capacity region. 
This unit resource region outlines what is achievable if one 
does not possess a noisy channel, but only possesses the 
three noiseless resources of classical communication, quantum 
communication, and entanglement. There, we found that the 
optimal strategy is to combine teleportation, super-dense cod- 
ing, and entanglement distribution. Interestingly, the above set 
of inequalities demonstrates that the one-shot, one-state region 
is a translation of the unit resource capacity region to the 
classically-enhanced father protocol. Indeed, eliminating the 
entropic quantities from ( [T2p4| reveals that the inequalities 
are the same as those that specify the unit resource capacity 
region. 

Proving that Theorem [T] holds consists of proving it in two 
steps, traditionally called the direct coding theorem and the 
converse. For our case, the direct coding theorem proves that 
the region corresponding to the right hand side of ( 1 1 1 is an 



achievable rate region. It constructs an EACQ protocol whose 
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rates are in the region of the right hand side of ( [TT] i and shows 
that its fidelity of quantum communication is high and its 
probability of error of classical communication is small. The 
converse assumes that a good code with high fidelity and low 
probability of error exists and shows that the region on the 



right hand side of ( 1 1 1 bounds the achievable rate region. We 
prove the converse in Section |V] and the direct coding theorem 
in Section IVll 



First, we rewrite the coherent information I{A)BX)^ as 
H{B\X)^ - H{E\X)^. Then I{X-B)„ + I{A)BX)^ = 
H{B)^ ~ H{E\X)^. Next, pure states of the form \lp)^ 
are sufficient to attain the classical capacity of a quantum 
channel |12|. We briefly recall this argument. An ensemble of 
the following form realizes the classical capacity of a quantum 
channel: 



P 



^■^'^^px {x)\x) {xf®pi' 



A. Special Cases of the Capacity Theorem 

We first consider five special cases of the above capacity 
theorem that arise when Q and E both vanish, C and E both 
vanish, or one of C, Q, or E vanishes. The first two cases cor- 
respond respectively to the Holevo-Schumacher- Westmoreland 
coding theorem and the Lloyd-Shor-Devetak coding theorem. 
Each of the other special cases traces out a two-dimensional 
achievable rate region in the three-dimensional capacity re- 
gion. The five coding scenarios are as follows: 

1) Classical communication (C) when there is no entangle- 
ment assistance or quantum communication H], IJ). The 
achievable rate region lies on the (C, 0, 0) ray extending 
from the origin. 

2) Quantum communication (Q) when there is no entan- 
glement assistance or classical communication i4|, O, 
||6l. The achievable rate region lies on the (0, Q,0) ray 
extending from the origin. 

3) Entanglement-assisted quantum communication (QE) 
when there is no classical communication ||T3]l . lfT4ll . The 
achievable rate region lies in the {0,Q,E) quarter-plane 
of the three-dimensional region in ( fTT) . 

4) Classically-enhanced quantum communication (CQ) 
when there is no entanglement assistance |7 1. The achiev- 
able rate region lies in the (C, Q, 0) quarter-plane of the 
three-dimensional region in ( [TT] l. 

5) Entanglement-assisted classical communication (CE) 
when there is no quantum communication lfT2l . The 
achievable rate region lies in the (C, 0, E) quarter-plane 
of the three-dimensional region in ( [TT| . 

1) Classical Capacity: The following theorem gives the 
one-dimensional capacity region Cq {Af) of a quantum channel 
J\f for classical communication 111, 

Theorem 2: The classical capacity region Cc(A/') is given 

by 



k=l 



The "one-shot" region C^^^(A/^) is the union of the regions 



This ensemble itself is a restriction of the ensemble in ^T5\ to 
the systems X and A'. Each mixed state admits a spectral 
decomposition of the form — J2yPY\x {y\x) '4'x,y where 
ip-^y is a pure state. We can define an augmented classical- 
quantum state 9-^^^ as follows: 



so that Try {6* 

„XA' 



XYA' 



^XA' 



} = P' 

States p'^^ and 9-^^"^ leads to the respective states p'^" and 
qXYB ^ Then the following equality and inequality hold: 



Sending the A' system of the 

,XB 



I{X-B)^ = I{X-B), 

<I{XY;B),, 



r.XA' 



and 



where the equality holds because Try{0 } = P 
the inequality follows from quantum data processing. Redefin- 
ing the classical variable as the joint random variable X, Y 
reveals that it is sufficient to consider pure state ensembles for 
the classical capacity. Returning to our main argument, then 
H iE\X)^ ^ H {B\X)^ so that /(X; B), + I {A)BX)^ = 
H (S)^-iJ iB\X)^ = I{X; B)„ for states of this form. Thus, 
the expression in ( [T8] l can never exceed the classical capacity 
and finds its maximum exactly at the Holevo information. 

2) Quantum Capacity: The following theorem gives the 
one-dimensional quantum capacity region Cq{N) of a quan- 
tum channel TV |4|, |5i, fSl. 

Theorem 3: The quantum capacity region Cq(A/') is given 

by 



OO ^ 



(19) 



fe=i 



The "one-shot" region Cq^''(A/^) is the union of the regions 
(17) cj^liAf), where C^^^(A/') is the set of all Q > 0, such that 



Q<I{A)BX),. 



(20) 



Cc,o-(-^)' where C^^j^iAf) is the set of all C > 0, such that The entropic quantity is with respect to the state a 

C <I{X-B)„+I{A)BX)^. (18) 
The entropic quantity is with respect to the state <j 



XABE 



( 15 1 with the restriction that the density p{x) is degenerate. 



XABE 

The bound in ( fTSj ) is a special case of the bound in ( [T4| with 
Q — Q and E = Q. The above characterization of the classical 
capacity region may seem slightly different from the original 
HSW characterization, until we make a few observations. 



The bound in (20 1 is a special case of the bound in (13 1 
with E = Q. The other bounds in Theorem [T] are looser than 
the bound in ([T3]l when C, = 0. 

3} Entanglement-Assisted Quantum Capacity: The follow- 
ing theorem gives the two-dimensional entanglement-assisted 
quantum capacity region Cqe(A/') of a quantum channel N 

ca, ma. 
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Theorem 4: The entanglement-assisted quantum capacity 
region Cqe(A/') is given by 



Cqe(AA) = U fc^QE 



(21) 



fe=i 



The "one-shot" region Cq^{N) is the union of the regions 
Cqe,^(-^)' '^^^'"^ Cqe,<t(-^) is the set of all Q,£; > 0, such 



that 



Q<-I{AX;B)„, 
Q <E + I{A)BX), 



The entropic quantities are with respect to the state a 



(22) 
(23) 

XABE 



in ( 15 1 with the restriction that the density p{x) is degenerate. 



and ( 23 1 are a special case of the 
ni andte) with C = 0. The other 



The bounds in (22 
respective bounds in 
bounds in Theorem [T] are looser than the bounds in ( [T2| ) and 
( [T3] l when C = 0. Observe that the region is a union of general 
pentagons (see the Q_E-plane in Figure |2] for an example of 
one of these general pentagons in the union). 

4} Classically-Enhanced Quantum Capacity: The follow- 
ing theorem gives the two-dimensional capacity region 
Ccq(-AA) for classically-enhanced quantum communication 
through a quantum channel N 

Theorem 5: The classically-enhanced quantum capacity re- 
gion Ccq{N) is given by 



OO 



(24) 



k=l 



The "one-shot" region Cqq{N) is the union of the regions 
4q,<t(-^)' where C^q.^IA/") is the set of all C, Q > 0, such 



that 



C 



Q<I{X-B)„- 
Q < I{A)BX% 



I{A)BX), 



(25) 
(26) 



The entropic quantities are with respect to the state u-^^^^ 
in ([B]). 



The bounds in (25 1 and (26i are a special case of the 
respective bounds in (13i and (14 1 with E — 0. Observe 



that the region is a union of trapezoids (see the CQ-plane 
in Figure |2] for an example of one of these rectangles in the 
union). 

The above characterization is a slightly improved charac- 
terization of the Devetak-Shor region from Ref. ||2l- Indeed, 
the one-shot, one-state region there was a union of rectangles 
given by the following set of inequalities: 



C<I{X-B),, 
Q < I{A)BX)„ 



(27) 
(28) 



These rectangles are inside the trapezoids above. Though, our 
characterization in ( p5p6] l is the same as theirs when we 
consider the union over all the one-shot, one-state regions. 



5) Entanglement-assisted Classical Capacity with Limited 
Entanglement: 

Theorem 6: The entanglement-assisted classical capacity 
region Cce{N) of a quantum channel M is 



C, 



CE 



W = U ^CcE 



(1) 



(29) 



fe=i 



The "one-shot" region C^g(A/^) is the union of the regions 
C^n^„{N), where 4e<t(-^) is the set of all C,E >0, such 



--CE.a 

that 



C < I{AX;B) 
C<I{X;B), 



I{A)BX)„ + E. 



(30) 
(31) 



where the entropic quantities are with respect to the state 

„XABE 



m 



15i 



The bounds in (BOj) and (BTJ are a special case of the 
respective bounds in \\2) and (|14|l with Q — Observe that 
the region is a union of general polyhedra (see the CE-plane 
in Figure |2] for an example of one of these general polyhedra 
in the union). 

The above characterization of the CE achievable region is 
again an improvement over the characterization in Refs. flTl, 
mi, lfT41 . It specifies a union of general trapezoids. The region 
in Refs. ifTTl . lfT2l . lfT4l was a union of general rectangles of 
the form: 



C<I{AX-B),, 
E>H{A\X),. 



(32) 
(33) 



These general rectangles are inside the above general trape- 
zoids (note that the bounds in ( (30pT| ) intersect at = 
H{A\X)a), but the regions coincide when we take the union 
over all the one-shot, one-state regions. 

V. The Converse Proof 

Our method for proving the converse of Theorem [T] is to 
apply standard entropic bounds that are available in Ref. |8|. 
We first recall the Fannes inequality for continuity of en- 
tropy, the Alicki-Fannes inequality for continuity of coherent 
information, and another inequality of the Fannes class for 
continuity of quantum mutual information. 

Theorem 7 (Fannes Inequality 4271/ )." Suppose two states 



and (T^ are close: 



< e. 



11^ 111 
Then their respective entropies are close: 

\H{A),-H{AU<e\og\A\ 



(34) 



I A I is the dimension of the system A and H2 (e) is the binary 
entropy function that has the property lim(:_j.o i?2(e) — 0. 

Theorem 8 (Alicki-Fannes Inequality I128]I }: Suppose two 
states p"^^ and a"^^ are close: 



AB\ 



< e. 



\\P - 111 
Then their respective coherent informations are close: 

\I{A)B)p - IiA)B),\ < 4elog \A\ + 2H2{e). 



(35) 
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- S,E 


+ 0). 






\\ AB_ AB\\ < 
1 r 111 — 


C + 2(5 


-s < 


-I(AM-B")u,, 
n 




(38) 


Then their respective quantum mutual informations are close: 




-S< 


-I{A)B''M)^ ^ 

71 




(39) 


|/(A;S)p-/(A;S),| <5elog|A|+3i/2(e). (36) 


C + Q 


-s< 


-(/(M;B")^4 

71 


-/(^)B"M)„) + i?, 


(40) 



Proof: The proof follows in two steps by applying Theo- 
rems |7] and [8] First, monotonicity of the trace distance under 
the discarding of subsystems implies that [jp'^ — ct'^Hj^ < e. 
Theorem |7] then applies. The corollary then follows from 
the equality I{A;B) = H {A) + I{A)B) and the triangle 
inequality. ■ 



Converse: Section III describes the most general EACQ 
protocol and this most general case is the one we consider in 
proving the converse. Suppose Alice shares the maximally en- 
tangled state <i)^"^i with the reference system R (the protocol 
should be able to transmit the entanglement in state with 
e-accuracy if it can approximately transmit the entanglement 
with system R for any pure state on R and Ai). Alice also 
shares the maximally entangled state $^aTb ^j^j^ Alice 
combines her system Ai of the quantum state $^"^1 with her 
system Ta of the state and the classical register M that 

contains her classical information. The most general encoding 
operation that she can perform on her three registers M, 
Ai, and Ta is a conditional quantum encoder ^^^^iTa-^a 
consisting of a collection {S^^'^^^^ }m of CPTP maps. For 
now, we assume this general form of the encoder but later 
show in Appendix |E] that it is only necessary to consider 
a collection of isometrics. Each element S^'^^^^^ " of the 
conditional quantum encoder produces the following state: 



AA'"^E'Tb 



u, 



AiTa^A'"E' ^^RAi 



where we consider the isometric extension [/, 



AiTa'^A"^E' 



of 



each element S^^^"^^^"^ . The average density operator over 
aU classical messages is then as follows: 



Alice sends the A'" system through the noisy channel 
[/j^ "^^"^", producing the following state: 



jMRB"E"E'Tb 

1 



(37) 



Define A = RTb so that the state in ( 37 1 is a particular n* 
extension of the state in ( [T5| ). The above state is the state at 
time t in Figure [T] Bob receives the above state and performs 
a decoding instrument Tb^BiBeM ^ -pj^g protocol ends at 
time tf. Let (^^'■^'^'^^^^^eME e ^^^^^ ^.^^^ ^^^^^ 

Bob processes uj^-^^-^ e e Tb ^jj-jj j-jjg decoding instrument 
T)B"Tb^b,BbM ^ Suppose that an {n,C - 6, Q - 6, E + 6, e) 
EACQ protocol as given above exists. We prove that the 
following bounds apply to the elements of its rate triple 



for any e,6 > and all sufficiently large n. In the ideal case, 
the identity quantum channel acts on system Ai to produce 
the maximally entangled state $^^1. So for our case, the 
following inequality 



RBi 



BBi 



< e 



(41) 



holds because the protocol is e-good for quantum communica- 



tion according to the criterion in ( 10 1. Also, in the ideal case, 
the identity classical channel acts on system M to produce the 



maximally correlated state <i)^^*^ where 

rMM _ 1 
" |M 



m 

So for our case, the following inequaUty 

, /.MM 
(OJ ) 



MM 



< e 



(42) 



(43) 



holds because the protocol is e-good for classical communica- 
tion according to the criterion in (j9]). We first prove the upper 
bound in ( [38] l on the classical and quantum rates. Shor's ver- 
sion lfT2l of the entanglement-assisted classical capacity theo- 
rem ijTOl, flm states that the rate I{AM ; B")/?t, is achievable 
and serves as a multi-letter upper bound. This bound implies 
that the unlimited entanglement-assisted quantum capacity is 
I{AM; B^)/2n. If it were not so, then one could convert all 
of the quantum communication to classical communication by 
super-dense coding and beat the rate I{AAI; B'"') /n. But this 
result contradicts the optimality of the unlimited entanglement- 
assisted classical capacity. These two results imply the bounds 
C < I{AM;B")/n and 2Q < I{AM;B")/n. But we can 
go further and prove that the sum rate is bounded as well. 
Suppose there exists a protocol that beats the sum rate in 



(38 1. With more entanglement, one could convert all of the 
quantum communication to classical communication by super- 
dense coding. But this result again contradicts the optimality 
of the unlimited entanglement-assisted classical capacity. So 
the bound C+2Q-S < I{AM; holds. We next prove 

the upper bound in ([39| on the quantum communication rate: 



n{Q-S) 
= I{R)Bi)^dBi 
<I{R)Bi)^,+inQe + H2{e) 

< I[R)BiM)^, + 4nQe + H2{e) 

< I{R)B"TbM)^ + AnQe + i72(e) 

< I{RTb)B''M)^ + H{Tb\M)^ + AnQe + H^ie) 

< I{A)B''M)^ ^nE + 4nQe + H2{e). (44) 

The first equality follows by evaluating the coherent informa 
tion for the state $^^1. The first inequality follows from (41 



9 



and the Alicki-Fannes inequality in Theorem |8] The second 
inequality is from strong subadditivity, and the third inequality 
is quantum data processing. The fourth inequality follows 
because H{Tb\B"1\I) < H {Tb\M) (conditioning reduces 
entropy). The last inequality follows from the definition A = 
RTr and the fact that H{Tb\M)^ < nE. The inequahty in 
(|39i follows by redefining 6 as S' = S + AQe + We 



prove the upper bound in (40 1 on the classical and quantum 
rates: 



/(M; M)-MM 



n{C + Q-6) 

I{R)Bi)^aBi 

< I{M; M)^, + I{R)Bi)^, + bnCe + AnQe + 5iJ2(e) 

< /(M; B"Tb)^' + I{R)B''TbM)^, + n5' 

= I{M- B^)^, + / (Af ; Tb\B^)^, + H {B''Tb\M)^, 

- H{RB'^Tb\M)^, +n6' 

= I{M-B^)^,+HITb\B^)^, 

+ H (5"|Af)^, - H{RB''Tb\M)^, + nS' 

< I(M- B")^, + H {Tb)^, + I (rTb)B''AI)^^ + n5' 
= I{M; B")^. + / {A)B''M)^., + nE + n6' . 

The first equality follows because the mutual information 

I{M; M) of the maximally correlated state <I)^^^^ is equal 
to nC. The first inequality follows by applying ( [43] l and 
Corollary [l] to the mutual information I{M;M), and (41 1 
and the Alicki-Fannes' inequality to the coherent information 
I{R)Bi). The second inequality follows by applying the 
quantum data processing inequality and strong subadditivity 
as we did in the proof of the previous bound and by defining 
5' = 5Ce + AQe + 5H2{e)/n. The second and third equalities 
follow by manipulating entropies. The third inequality follows 
from the definition of coherent information and because con- 
ditioning does not increase entropy. The last inequality follows 
from the definition A = RTb and because nE is the maximal 
value that H [Tb) can take. ■ 

VI. The Direct Coding Theorem 

In this section, we prove the direct coding theorem for 
entanglement-assisted communication of classical and quan- 
tum information by giving a combination of strategies that 
can achieve the rates in Theorem [T] The most important devel- 
opment is the introduction of the classically-enhanced father 
protocol and its corresponding proof in the next section. This 
protocol yields a corner point in the achievable region (see, 
for example, the point labeled CEF in Figure [2|l. Section VI-F 
shows that combining this protocol with teleportation, super- 
dense coding, and entanglement distribution allows us to 
obtain all other corner points of the achievable rate region. 
Thus, this protocol is the most general one available for the 
channel coding scenario. 

A. The Classically-Enhanced Father Protocol 

We can phrase the classically-enhanced father protocol as a 
resource inequality (see Ref. fT?) for the theory of resource 



inequalities): 
1 



-I{A-E\X)^[qq] 



> -I {A; B\X)^ [q^q]+I {X; B)^ [c ^ c] 



(45) 



The precise statement of the classically-enhanced father re- 
source inequality is a statement of achievability. For any 
e, (5 > and sufficiently large n, there exists a protocol 
that consumes n uses of the noisy channel and 
consumes w nl {A]E\X)^ /2 ebits. In doing so, the protocol 
communicates w nl {A\B\X)^ /2 qubits with 1 — e fidelity 
and w nl {X\ B)^ classical bits with e probability of error 
The entropic quantities are with respect to the state u-^^be 
in ([15]). 

The proof of the achievability of the classically-enhanced 
father protocol proceeds in several steps. We first establish 
some definitions relevant to an entanglement-assisted quantum 
code, or father code for short, and recall the direct coding theo- 
rem for entanglement-assisted quantum (EAQ) communication 
Id, lfT4l . ifTSl . We then define a random father code, give a 
few relevant definitions and properties, and prove a version 
of the EAQ coding theorem that applies to random father 
codes. In particular, we show random father codes exist whose 
expected channel input is close to a product state (similar to 
result of the random quantum coding theorem in Appendix 
D of Ref |f6|). We follow this development by showing how 
to "paste" random father codes together so that the expected 
channel input of the pasted random code is close to a product 
state containing a classical message. A random classically- 
enhanced father code is then a collection of "pasted" father 
codes. The closeness of each expected channel input to a 
product state allows us to apply the HSW coding theorem 
im, Q so that Bob can decode the classical message while 
causing almost no disturbance to the encoded quantum infor- 
mation. Based on the classical message. Bob determines which 
random father code he should be decoding for. This method 
of efficiently coding classical and quantum information is the 
"piggybacking" technique introduced in Ref. |7| and applied 
again in Refs. iBSl . ll29l . The final arguments consist of a 
series of Shannon-theoretic arguments of derandomization and 
expurgation. The result is a deterministic classically-enhanced 
father code that performs well and achieves the rates in the 
capacity region in Theorem [T] 



B. Father Codes 

The unencoded state of a father code is as follows 

where 



(46) 



k=l 



\<^f^^" = -L^ y |m)^-^ \m) 

^ m—1 



Tb 
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\(t>k,m)^ — £^ 

where the states \(t>k,-. 



The isometric encoder g^^^A^A father code maps 

kets on the systems Ai and Ta as follows 

) ^ are mutually orthogonal. Therefore, 
the encoder g^^'^A^A jj^^pg j-jjg unencoded state in (46 1 to 
the following encoded state: 



1$ 



TaTe 



fc=i 



in the following defini- 



where we define the states 
tion. 

Definition 1: The set C 
of the father code. The EAQ codewords are as follows: 



{\4'k)^ is a representation 



A'"Tb 



1 



2"" 



\4>k.r. 



m) 



(47) 



The EAQ code density operator '^'^ (C) is a uniform 
mixture of the EAQ codewords: 



(C) 



1 



2"« 
/c = l 

A'" 



The channel input density operator p (C) is the part of the 
code density operator (C) that is input to the channel: 

p^'" (C)^Trr,{/'"^MC)}. 
The above definitions imply the following two results: 



A'^Tb 



AiTa^A'" 



(C) 



1 



EE 



'^k,m 



I A'" 



2«(Q+S) 

k=l m=l 

The direct coding theorem for entanglement-assisted quan- 
tum communication gives a method for achieving the multi- 
letter quantum communication rate and entanglement con- 
sumption rate. 

Proposition 1 (EAQ Coding Theorem): Consider a quan- 
tum channel and its isometric extension -^be ^ 
any e, (5 > and all sufficiently large n, there exists an (n, e) 
entanglement-assisted quantum code defined by isometrics 
(£,!?), such that the trace distance between the actual output 



and the ideal decoupled output 



¥B"E" 



AiTa 



TaTp, 



Be 



(^--"^(K)/3 (48) 

is no larger than e, for any state ip^^^ with dimension 
2"<3 in the system Ai and any maximally entangled ^'^^Tb 
equivalent to nE ebits. The rate of quantum communication 
is Q — (5 = -^liA; B)^ provided that the rate of entanglement 
consumption is at least E + 5 ^ h^{A\ E)^. The entropic 
quantities are with respect to the following state: 

\A\ABE _ jj^^BE^IAAA' 



(49) 



where IV')^^ is the purification of some state p"^ . 
Proof: See Ref. ifBI . 



C. Random Father Codes 

We cannot say much about the channel input density op- 
erator " (C) for a particular EAQ code C. But we can say 
something about the expected channel input density operator 
of a random EAQ code C (where C itself becomes a random 
variable). 

Definition 2: A random EAQ code is an ensemble {pc,C} 
of codes where each code C occurs with probability pc- The 
expected code density operator p"^ is as follows: 



-A'"Ti5 



c{p-^'"^MC)} 



The expected channel input density operator is as fol- 

lows: 



P^'" = Ec 



A random EAQ code is "p-like" if the expected channel input 
density operator is close to a tensor power of some state p: 



A'" 



< e. 



p - p- 

It follows from the above definition that 

P^'"^^=E^^c/'"^MC), 



(50) 



p^'" = 



c 



We now state a version of the direct coding theorem that 
applies to random father codes. The proof shows that we can 
produce a random father code with an expected channel input 
density operator close to a tensor power state. 

Proposition 2: For any e,S > and all sufficiently large 
n, there exists a random p"^ -like EAQ code for a channel 
A/^'^ . In particular, the random EAQ code has quan- 
tum rate ^I{A;B)^ — S and entanglement consumption rate 
|/(j4; E)^ + S. The entropic quantities are with respect to the 

state in (49 1 and the state p'^ is that state's restriction to the 
system A'. 

Proof: The proof is in the Appendix |A] ■ 

D. Associating a Random Father Code with a Classical String 

Suppose that we have an ensemble {p (x) , Px}.j;^x °f 
quantum states. Let a;" = xi ■ ■ ■ Xn denote a classical string 
generated by the density p (x) where each symbol Xi S X. 
Then there is a density operator p^^^ corresponding to the string 
a;" where 



Suppose that we label a random father code by the string a;" 
and let p;^„ denote its expected channel input density operator. 

Definition 3: A random father code is (pj;n )-like if the 
expected channel input density operator p^„ is close to the 
state px^: 



A'" 



< e. 



Proposition 3: Suppose we have an ensemble as above. 
Consider a quantum channel with its isometric ex- 

tension U^^^^ . Then there exists a random (p2.n)-like 



11 



entanglement-assisted quantum code for the channel 
for any e, ^ > 0, for all sufficiently large n, and for any 
classical string a;" in the typical set [.30i . Its quantum 

communication rate is I{A;B\X)/2—c'S and its entanglement 
consumption rate is I{A; E\X) /2 + c"S for some constants 
c', c" where the entropic quantities are with respect to the state 
in (15 1 with a trivial system E' . The state p^^ is generated 
from the restriction of the ensemble {p [x] , (j)^^ jj-g x to the 
A' system. The states <f)^"^ in the ensemble correspond to the 



states 6^^' in (115 i 



Proof: The method of proof involves "pasting" random 
father codes together. The proof is in the Appendix |B] ■ 

E. Construction of a Classically-Enhanced Father Code 

The HSW coding theorem gives an achievable method for 
sending classical information over a noisy quantum channel. 
The crucial property that we exploit is that it uses a product- 
state input for sending classical information. This tensor- 
product structure is what allows us to "piggyback" classical 
information onto father codes. 

Proposition 4 (HSW Coding Theorem jTp, (^): Consider 
an input ensemble {p (x) , } that gives rise to a classical- 
quantum state where 



XB 



pix)\x) {xf ^M^'^^ip^') 



A'^B/ A'\ 



Let C — I{X; B)^ — c'5 for any 5 > {) and for some constant 
c'. Then for all e > and for all sufficiently large n, there 
exists a classical encoding map 



/ : [2"^] ^ T, 



and a decoding POVM 



(aD 



me 2" 



that allows Bob to decode any classical message m S [2"*-^] 
with high probability: 

Tr{T„f Af }> 

The density operators t^" are the channel outputs 



iPf{m))^ 



(51) 



and the channel input states are a tensor product of states 
in the ensemble: 



Px[- 



We are now in a position to prove the direct coding part of 
the classically-enhanced father capacity theorem. The proof is 
similar to that in Ref. I?). 

Direct Coding Theorem: Define the classical mes- 
sage set [2"'-^] , the classical encoding map /, the channel 
output states , and the decoding POVM A^ as in 
Proposition |4] Invoking Proposition [3] we know that for 
each m e [2"*-^], there exists a random (/9^^^^)-like fa- 
ther code Cm whose probability density is pc,„- The ran- 
dom father code Cm has encoding-decoding isometry pairs 



c,"^"'^^'^'") for each of its reahzations. It 



transmits n[I{A; B\X)/2 — c'S] qubits provided Alice and Bob 
share at least n[I{A; E\X)/2 + c"S] ebits. Let C denote the 
random classically-enhanced father code that is the collection 
of random father codes {Cm}me[2"'^]- ^^^^ prove that the 
expectation of the classical error probability for message m 
is small. The expectation is with respect the random father 
code Cm- Let denote the channel output density operator 
corresponding to the father code C,„: 

Tc^;; EE TrT, (f^f ""^'"(^-"^ ® ci>^-^-)) } . 

Let denote the expected channel output density operator 
of the random father code Cm'- 



Er 



The following inequality holds 



Pf{va 



< \X\e 



because the random father code Cm is (p^(-^^)-like. Then the 
expected channel output density operator is close to the 
tensor product state t,^ in 



(0: 



< \X\e, 



(52) 



because the trace distance is monotone under the quantum 
operation J^^'^^b"- , it then follows that the POVM element 
A^j has a high probability of detecting the expected channel 
output density operator : 



Tr{A, 



7n ni 



T„, } > Tr{A„ T,„ } 
> l-€-\X\e. 



-B" 



(53) 

The first inequality follows from the following lemma ETIl 
that holds for any two quantum states p and a and a positive 
operator 11 where < 11 < /: 

Tr{np}>Tr{na}-||p-a||i. 

The second inequaUty follows from Proposition |4] and ( [52| . 
We define Bob's decoding instrument dB^Tb^b^BeM 
random classically-enhanced father code C as follows: 



for the 



V, 



B"Tb^BiBeM 
C 



B^Tb —>-Bi Be 



B"Tb 



A£' 



I A/ 



where Tb^BiBe decoding isometry for the father 

code C,„ "and each map (^V^pV'A^) is 

trace reducing. The induced quantum operation corresponding 
to this instrument is as follows: 



X)^"Tb^B,Be 



Tr 



V. 



B"Tb^BiBeM 
C 



(P)} 



Let pe {Cm) denote the classical error probability for each 
classical message m of the classically-enhanced father code 
C: 

Pe {Cm) = 1 - Pr{A/' = m\M = m} 

'\B^^Tb^Bi Bi 



= 1 - Tr <^ 2?, 



m '^C^ ' 



^Ar 
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Then by the above definition, ( [53| , and the fact that the trace is invariant under isometry. The second equahty follows be- 
cause the operator fl^'^^^ is positive. The third equality 
follows from some algebra, and the second inequality follows 



does not change under the isometry T>^ Tb^BiBe ^ jj j^qJ^j^ 
that the expectation of the classical error probability {Cm) 
with respect to the random father code Cm is low: 



Ec„,{Pe [Cm)} < (I + IA-I). 



(54) 



We now prove that the expectation of the quantum error is 
small (the expectation is with respect to the random father code 
Cm)- Input the state (g)*'^-*'^^ to the encoder S^^^^^"^'" 
followed by the channel "^s" -pj^g resulting state is an 
extension f^cf''^" '^cl'- 



1 (^T^aTi 



^RTbB" 



Let ilm denote the expectation of 
to the random code Cm- 



RTbB'^ 



with respect 



It follows that ^^^^^ is an extension of Tm 
inequality follows from p3|): 



The following 



Tr{a 



j^RTbB^ B 



Af„ } > 1 - (1 + 1^-1)6. 



(55) 



The above inequality is then sufficient for us to apply a 
modified version of the gentle measurement lemma (Lemma [T] 
in the Appendix [C]) so that the following inequality holds 



A^" ~ n 



RTbB" 



\X\)e. 



(56) 



Monotonicity of the trace distance gives an inequality for the 
trace-reducing maps of the quantum decoding instrument: 



V. 



B^^Tb^Bi Be 



< y8(r 



\X\)e. 

The following inequality also holds 



(57) 



E^ 



< Er: 



V. 



B^^Tb^Bi Bi 
C 



^B^^Tb -^B\ Be 



E 



^B^Tb^BiI 



Y: Tr{A5<^-^"} 



= i-Trurnr^^^ 



< (1 + 1^-1)6. 



(58) 



The first inequality follows from definitions and the triangle 
inequality. The first equality follows because the trace distance 



from (53 1. The fidelity of quantum communication for all 
classical messages m and codes Cm is high 



F ( J^^^^b^B\Be 



RBi 



> 1 -e 



because each code Cm in the random father code is good for 
quantum communication. It then follows that 



V, 



B^^Tb^BiBe 



(f^lf-^") <2yi (59) 



because of the relation between the trace distance and fidelity 



[21]. Application of the triangle inequality to (59i, (58 1, and 



( 57 1 gives the following bound on the expected quantum error 

Ec,„ {qe (Cm)} < e (60) 

where 



e' = (1 + \X\)e + v/8(l + |A'|)e + 2^~e, 

and where we define the quantum error [Cm) of the code 
Cm as follows: 



qe (Cn 



T^B^Tb- 



RBi 



The above random classically-enhanced father code relies 
on Alice and Bob having access to a source of common 
randomness. We now show that they can eliminate the need for 
common randomness and select a good classically-enhanced 
father code C that has a low quantum error (C,„) and low 
classical error pe {Cm) for all classical messages m in a large 
subset of [2"*^] . By the bounds in (54i and (60 1, the following 



bound holds for the expectation of the averaged summed error 
probabilities: 



Er 



1 



2nC 



Pe {Cm)+qe {Cm) > <e' + (1 + 1^-1 



If the above bound holds for the expectation over all random 
codes, it follows that there exists a particular classically- 
enhanced father code C = {Cm}mel2"<^] ^^'-^ following 
bound on its averaged summed error probabilities: 

^J2p- (^™) + 1- (^™) < + (1 + l-^De- 

We fix the code C and expurgate the worst half of the father 
codes — those father codes with classical messages m that 
have the highest value of p^ {Cm) + qe {Cm)- This derandom- 
ization and expurgation yields a classically-enhanced father 
code that has each classical error pe {Cm) and each quantum 
error q^ {Cm) upper bounded by 2 (e' + (1 + |'-t'|)e) for the 
remaining classical messages m. This expurgation decreases 
the classical rate by a negligible factor of ^. ■ 
Note that the above proof is a scheme for entanglement 
transmission. This task is equivalent to the task of subspace 
transmission (quantum communication) by the methods in 
Ref. Ell. 
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F. Child Protocols 

We detail five protocols that are children of the classically- 
enhanced father protocol in the sense of Ref. fT?]. Recall the 



classically-enhanced father resource inequality in (45 1. Recall 
the three respective unit resource inequalities for teleportation, 
super-dense coding, and entanglement distribution: 



2[c- 

[m] 



c] + [qq] > [g ^ 
[q^ q]> [qq] 



q]. 



(61) 
(62) 
(63) 



We can first append entanglement distribution to the 
classically-enhanced father resource inequality. This append- 
ing gives rise to the classically-enhanced quantum commu- 
nication protocol in Ref. Q. The development proceeds as 
follows: 

{M^'^^) + \l{A,E\X) [qq] 
> h {A; B\X) [q^q]+I {X; B) [c ^ c] 



1 



/ {A- E\X) + I {A)BX) ][q^q]+I {X- B) [c ^ c] 



> -/ {A; E\X) [qq] + / {A)BX) [q ^ q] + I {X; B) [c ^ c] , 

where the first inequality is the classically-enhanced father 
resource inequality, the first equality exploits the identity in 
and the last inequality follows from entanglement distri- 
bution. By the cancellation lemma (Lemma 4.6 of Ref. lfT4l ). 
the following resource inequality holds 

(A/-^'^^) + o [qq] > I {A)BX) [q^q]+I (X; B) [c ^ c] , 

(64) 

where a \qq\ represents a sublinear amount of entanglement. 
The above resource inequality is equivalent to the classically- 
enhanced quantum communication protocol in Ref. |7| (mod- 
ulo the sublinear entanglement). Combining the above re- 
source inequality further with entanglement distribution gives 
the classically-enhanced entanglement generation protocol 
from Ref. Q: 

(AA^'^^) + [qq] > I {A)BX) [qq] + / (X; B) [c ^ c] . 

We can combine the classically-enhanced father protocol 
with super-dense coding and entanglement distribution. Let 
CEF-SD-ED denote the resulting protocol. The development 
proceeds by first using qubits at a rate {A[X) for entan- 
glement distribution: 

{U^'^^) + \l{A-,E[X)[qq] 
>\l{A-B[X) [q- 



\h{A[X) + \i{A)BX 



q]+I{X:B) [c->c] 

[q^q]+I{X-B) [c^c] 



> Ih{A[X) [qq] 



1 



/ {A)BX) [q^q]+I {X; B) [c c] 



After this step, the above protocol is equivalent to the follow- 
ing one: 

{N)+o [qq] > h {A)BX) {[qq] + [q ^ q])+I (X; B) [c ^ c] 



SO that it has generated entanglement at a net rate of 
\l[A)BX) ebits. We can then further combine with super- 
dense coding to achieve the protocol CEF-SD-ED: 



A'-fB\ 



o [qq] > I {A)BX) [c ^ c] + I {X; B) [c - 



We can combine the classically-enhanced father protocol 
with super-dense coding to get Shor's entanglement-assisted 
classical (EAC) communication protocol lfT2l : 

{N^'^^)+H{A[X) [qq] 



■A'^B\ 



= {N- 

> IhA;B[X) [q- 
+ ^I{A;B[X) [qq] 



l-I{A;E[X) [qq] + l-I{A;B[X) [qq] 



q]+I{X;B) [c^c] 



>I{X;B) [c^ 
= / {AX; B) [c 



c]+I{A;B[X) [c-^ c] 



(65) 



The first equality uses the identity in ([T]). The first inequality 
uses the classically-enhanced father resource inequality. The 
second inequality uses super-dense coding, and the last equal- 
ity uses the chain-rule identity in Q. The above rates are the 
same as those in Refs. llT2l . ||T4| . 

Teleportation is the last unit resource inequality with which 
we can combine the classically-enhanced father protocol. 
Let CEF-TP (classically-enhanced father combined with tele- 
portation) denote the resulting protocol. Consider that the 
classically-enhanced father protocol generates classical com- 
munication at a rate / {X\ B). Alice and Bob can teleport 
quantum information if they have an extra / {X\ B) /2 ebits 
of entanglement. The development proceeds as follows: 



{N 



A'^B- 



+ ^I{A;E[X)[qq] + ^I{X;B) [qq] 



I{X;B) [c 
1 



c 



\l{X;B) [qq] 



I{X;B) [q^q] 



> -I{A;B[X) [q^q] 

> \l{A;B[X) [q^q] 
^^I{AX;B) [q^q]. 

We apply teleportation to get the second inequality and the 
chain rule in Q to get the last equality. We can rewrite the 
above protocol as follows: 



{M^'^B) + l{I{A;E[X)+I{X:B))[qq] 

> ll{AX;B) [q^q]. 

This protocol is the same as the father protocol if random 
variable X has a degenerate distribution. 

VII. Single-Letter Examples 

Theorem[T]is a general theorem that determines the capacity 
region of any entanglement-assisted channel for classical and 
quantum communication. Unfortunately, the theorem is of a 
multi-letter nature, implying that it is an intractable problem 
to compute the capacity region corresponding to an arbitrary 
channel. 
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In the forthcoming subsections, we provide several exam- 
ples of channels for which we can exactly compute their 
corresponding capacity regions. The first example is the trivial 
completely depolarizing channel (the channel that replaces the 
input state with the maximally mixed state). We find this ex- 
ample interesting despite its triviality because it coincides with 
our results in Ref. ||26| . The second example is the quantum 
erasure channel f32\. The advantage of the quantum erasure 
channel is that we can apply simple reasoning to determine 
the outer bound of its corresponding capacity region. We then 
show that the inner bound corresponding to the achievable 
region of this channel matches the outer bound. Thus, we know 
the full capacity region for the quantum erasure channel. The 
final channel that we single-letterize is the qubit dephasing 
channel. Perhaps surprisingly, we are able to do so by arguing 
that the Devetak-Shor CQ region and the Shor CE region each 
single-letterize. 

A. The Completely Depolarizing Channel 

The first single-letter example that we consider is the 
completely depolarizing channel. This channel simply replaces 
the input state with the maximally mixed state. Therefore, no 
classical or quantum information can traverse it, even with the 
help of entanglement. 

Corollary 2: The following set of inequalities specifies the 
entanglement-assisted capacity of the completely depolarizing 
channel: 

C + 2Q < 0, 
Q<E, 
C + Q<E. 

Proof: The proof follows by considering that the mutual 
information I{AX; B) and the Holevo information I{X\ B) in 
Theorem[T]vanish for any fc-qudit state transmitted through the 
completely depolarizing channel and the coherent information 
is either negative or zero for any input state. Then the 
inequalities ( T2p4 i there become the respective inequalities 
above. ■ 
One should observe that the region is actually trivial (it is 
empty) because C + 2Q < 0. Nevertheless, we still find the 
inequalities in Corollary [2] interesting because they coincide 
with those that we found in Ref. |26| for the "unit resource 
capacity" regiorj^ (modulo a different sign convention with 
the entanglement rate E). The proof techniques in Ref. ||26| 
involve reductio ad absurdum arguments that show how points 
outside the region conflict with physical law, whereas the 
arguments in the converse proof of Theorem [T] are information 
theoretic. One should expect that the set of inequalities in 
Corollary |2] coincide with those for the unit resource capacity 
region because having access to the completely depolarizing 
channel is equivalent to having no quantum channel at all — 
Bob can actually simulate this resource locally merely by 
preparing the maximally mixed state in his laboratory. 

'^The unit resource capacity region is the set of rates tliat are achievable 
without the aid of a noisy resource. 



Capacity 


Rate Triple (C, Q, E) 


Entanglement-assisted classical capacity (EAC) 


(2(l-e),0,l) 


Quantum capacity (LSD) 


(0, 1 - 2e,0) 


Classical capacity (HSW) 


(1-6,0,0) 


Entanglement-assisted quantum capacity (EAQ) 


(0,l-e,e) 



TABLE I 

The left column gives a particular type of capacity for the 
quantum erasure channel, and the right column gives the 
corresponding optimal rate triple. 



B. The Quantum Erasure Channel 

The quantum erasure channel is perhaps one of the simplest 
noisy quantum channels 1321 . because it has a simple speci- 
fication and its known transmission capacities admit simple 
formulas ll33l . A quantum erasure channel passes the input 
state along to the environment and gives Bob an erasure state 
|e) with probability e. It passes the input state along to Bob 
and gives the environment an erasure state |e) with probability 
1 — e. It induces the following map on a density operator : 



P^' ^ (1 



e e 



and its isometric extension acts as follows: 



1^) 



AA' 



where IV')^^ is some purification of p"^ . 

Table [l] lists the known optimal transmission capacities 
for the quantum erasure channel. Bennett et al. determined 
the classical capacity of the quantum erasure channel with 
an intuitive argument (the outer bound exploits the Holevo 
bound m and the inner bound uses an encoding with or- 
thogonal states), and they determined its quantum capacity 
with a different intuitive argument (the well-known no-cloning 
argument combined with linear interpolation for the outer 
bound and one-way random hashing for the inner bound 
||33]| ). The optimality of the classical rate 2 (1 — e) and the 
quantum rate 1 — e of an entanglement-assisted quantum 
erasure channel follows from the arguments in Ref. |10|. The 
optimality of the respective entanglement consumption rates 
follows from our forthcoming arguments. Finally, note that 
we can obtain the quantum capacity result by combining the 
father protocol (entanglement-assisted quantum communica- 
tion) with entanglement distribution at a rate e, and we can 
obtain entanglement-assisted classical communication from 
entanglement-assisted quantum communication by consuming 
all of its quantum communication at rate 1— e with super-dense 
coding. 

Corollary [3] below shows that the CQE capacity region of a 
quantum erasure channel admits a simple characterization in 
terms of three inequalities. We prove the converse by intuitive 
reasoning that one would perhaps expect to be able to apply to 
the quantum erasure channel, given earlier intuitive reasoning 
that authors have applied to this channel. We prove the direct 
coding theorem by giving an explicit ensemble that reaches 
afl of the bounds in the inequalities in Corollary |3] The result 
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is that time-sharing between the four protocols in Table |l] is 
the optimal coding strategy. 

Corollary 3: Suppose a quantum erasure channel has an 
erasure probability e. The following set of inequalities specifies 
the capacity region of this entanglement-assisted channel for 
transmitting classical and quantum information: 



C + 2Q<2(l-e), 
^'^^C + Q<E+{l~2e), 
<{l-e){l + E). 



1 - e 

C + + 



(66) 
(67) 
(68) 



Proof: We first prove the converse. The first bound in ( |66| ) 
holds because the sum rate C+2Q can never exceed 2 (1 — e). 
Otherwise, one could beat the entanglement-assisted classical 
capacity by dense coding or one could beat the entanglement- 
assisted quantum capacity by teleportation. We next consider 



the second bound in (67 i. We first prove that time-sharing 
between the HSW point and the LSD point is an optimal 
strateg}]^ and then show that this result implies the bound in 
( [67] i. Consider a scheme of quantum error correction for an 
erasure channel with erasure parameter e. If Alice transmits n 
qubits, then Bob receives e) of these and the environment 
receives ne of them (for the case of large n). From these 
n(l — e) physical qubits. Bob can perform a decoding to 
obtain n(l — 2e) logical qubits, by the quantum capacity result 
for the erasure channel. This implies an optimal "decoding 
ratio" of n{l — 2e) decoded qubits for the n{l — e) received 
qubits: (1 — 2e)/(l — e). Now let us consider a Devetak- 
Shor-like code for the erasure channel. Suppose that Alice 
can achieve the rate triple (A(l - e), (1 - A)(l - 2e) -I- 6, 0) 
where S is some small positive number (so that this rate 
triple represents any point that beats the time-sharing limit). 
Now if Alice transmits n qubits. Bob receives n{l — e) 
of them and the environment again receives ne of them. 
But this time. Bob performs measurements on nA(l — e) 
of them in order to obtain the classical information. Thus, 
these qubits are no longer available for decoding quantum 
information because the measurements completely dephase 
them. This leaves n(l — e) — nA(l — e) = n(l — A)(l — e) 
qubits available for decoding the quantum information. If 
Bob could decode n{(l — A)(l — 2e) + 6) logical qubits, 
this would contradict the optimality of the above "decoding 
ratio" because - A)(l - 2e) + S)/{n{l - A)(l - e)) = 
(1 - 2e)/(l - e) + d/{l - A)(l - e) is gi-eater than the 
optimal decoding ratio (1 — 2e)/(l — e). Therefore, he must 
only be able to decode n{l — A)(l — 2e) logical qubits. 
This proves that time-sharing between HSW and LSD is an 
optimal strategy for the quantum erasure channel. Now, the 
capacity region excludes any point lying above the CQ-plane 

^Time-sharing is a simple metliod of combining coding strategies (30|. As 
an example, consider the case of time-sharing a channel between an (n, Qi, e) 
quantum code and another {n,Q2,e) quantum code. For any A where < 
A < 1, the sender uses the first code for a fraction A of the channel uses and 
uses the other code for a fraction (1 — A) of the channel uses. This time- 
sharing strategy produces a quantum code with rate AQi -I- (1 — A) Q2 and 
en'or at most 2e. Time-sharing immediately gives that the convex hull of any 
set of achievable points is an achievable region. 

^Devetak and Shor stated (but did not explicitly prove) that time-sharing 
between HSW and LSD is optimal for the erasure channel (7). 



with which we can combine with entanglement distribution 
to reach a point on the CQ-plane outside the Devetak-Shor 
time-sharing bound (otherwise, we would be able to beat the 
time-sharing bound between HSW and LSD by combining 
this point with entanglement distribution). In particular, this 
means that achievable points cannot be outside the plane 
containing the vector connecting LSD to HSW and the vector 
of entanglement distribution. It is straightforward to calculate 
the equation for this plane. The vector connecting LSD to 
HSW is 

(0, 1 - 2e, 0) - (1 - e, 0, 0) = (- (1 - e) , 1 - 2e, 0) . 

The vector of entanglement distribution is (0, — 1,— 1). A 
normal vector for the plane containing the two vectors is 

1 - 2e 

,-1,1 



1 

Then the equation for the plane is 

1 - 2e 



1 



(C - (1 - e)) - Q + E = 0, 



implying that achievable points must obey the bound in (67i 



because they cannot lie outside this plane. The above argument 
also shows that the EAQ rate triple (0, 1 — e, e) is optimal (in 
particular, that the entanglement consumption rate is optimal) 
because it lies at the intersection of the two bounds in (|66ll and 



(67 1. We now prove the last bound in (681 in three steps. We 
first prove that the entanglement consumption rate of the EAC 
protocol is optimal. We then prove that time-sharing between 
EAC and HSW is optimal, and finally rule out all points 
outside a plane containing the vector connecting EAC to HSW 
and the vector of super-dense coding. Consider the EAC rate 
triple (2 (1 — e) , 0, 1). The entanglement consumption rate of 
one ebit per channel use is optimal, i.e., one cannot achieve 
the classical rate of 2 (1 — e) with less than one ebit per 
channel use. The state that achieves capacity is the maximally 
entangled state |$^). The minimum amount of entanglement 
that this capacity-achieving state requires is H (A) = 1 ebit 
(we give a more detailed proof in Appendix |D]). Thus, no 
lower amount of entanglement could suffice for achieving 
the maximal classical rate. We now prove that time-sharing 
between EAC and HSW is optimal by an argument similar 
to the argument for our other time-sharing bound. Consider 
a scheme of entanglement-assisted classical communication 
for an erasure channel with erasure parameter e. If Alice 
transmits n qubits (that could potentially be entangled with 
n qubits of Bob's), then Bob receives n(l — e) of these and 
the environment receives ne of them (for the case of large 
n). From these n(l — e) physical qubits (and his halves of 
the ebits). Bob can perform a decoding to obtain n2(l — e) 
classical bits, by the entanglement-assisted classical capacity 
result for the erasure channel. This implies an optimal "EA 
decoding ratio" of n2(l — e) decoded bits for the n(l — e) 
received qubits: 2(1 — e)/(l — e). Now let us consider a Shor- 
like cod^ for the erasure channel. Suppose that Alice can 
achieve the rate triple (A(l - e) + (1 - A)2(l - e) + (5, 0, 1 - A) 
where S is some small positive number (so that this rate triple 

^"Shor-like" in the sense of Ref. |12|. 
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represents any point that beats the time-sharing limit). Now 
if AHce transmits n qubits, then Bob receives 7i(l — e) of 
them and the environment again receives ne of them. But this 
time. Bob performs some measurement on nA(l — e) of them 
in order to obtain some of the classical information. Thus, 
these qubits are no longer available for decoding any more 
classical information because they have already been decoded. 
This leaves n{l — e) — nX{l — e) = n{l — A)(l — e) qubits 
available for decoding the extra classical information. If Bob 
could decode n((l — A)2(l — e) + 6) extra classical bits, this 
would contradict the optimality of the above "EA decoding 
ratio" because - A)2(l - e) + 6)/{n{l - A)(l - e)) = 
2(1 - e)/(l - e) + (5/(1 - A)(l - e) is greater than the optimal 
decoding ratio 2(1— e) /(I — e). Therefore, he must only be able 
to decode n(l — A)2(l — e) classical bits. This proves that time- 
sharing between HSW and EAC is an optimal strategy for the 
quantum erasure channel. Now, the capacity region excludes 
any point lying to the right of the C£'-plane with which we can 
combine with super-dense coding to reach a point on the CE- 
plane outside the time-sharing bound (otherwise, we would be 
able to beat the time-sharing bound between HSW and EAC by 
combining this point with super-dense coding). In particular, 
this means that achievable points cannot be outside the plane 
containing the vector connecting HSW to EAC and the vector 
of super-dense coding. It is straightforward to calculate the 
equation for this plane. Consider that the vector between EAC 
and HSW is 

(2(l-e),0,l)-(l-e,0,0) = (l-e,0,l). 

The vector of dense coding is (2, —1, 1). A normal vector for 
this plane is 

(-l,-(l + 6),l-e). 
The equation for the plane is 

~{C~{l~e))-{l + e)Q+{l-e)E = 0, 



implying that achievable points must obey the bound in (68 1 
because they cannot lie outside this plane. We have now 
completed the proof of the outer bound. We prove the direct 
coding theorem. The simple way to prove it follows simply by 
time-sharing between the four protocols HSW, LSD, EAQ, and 
EAC, but it is interesting to explore a particular ensemble of 
states of the form ( [T5] l in Theorem [T] that achieves the capacity. 
We consider transmitting the A' system of the following 
classical-quantum state through the channel: 



XAA' 



(69) 



where 



i^o>^^'^v^ioo)^^' + yr^iii)^-^' 



sAA' 



v/r^ioo) 



AA' 



AA' 



and /i e [O, |]. This classical-quantum state is a purified 
version of the ensemble considered in Ref. 121 . We can 




EAQ 



Fig. 3. (Color online) The capacity region of the quantum erasure channel 
with erasure parameter e = 1/4. Planes I, 11, and III coirespond to the 
respective bounds in j66|68j. The optimal strategy is to time-share between 
classical coding (HSW), quantum coding (LSD), entanglement-assisted quan- 
tum coding (EAQ), and entanglement-assisted classical coding (EAC). The 
classically-enhanced father protocol does not give any improvement over time- 
sharing for a quantum erasure channel. 



evaluate various relevant entropic quantities for this state: 

H{B)^ = l-e + H,{e), 

H{A\X)^^H2{iJi), 

H iE\X)^ = eH2 (m) + H2 (e) , 

where the state a is the state resulting from sending the A' 
system through the erasure channel. It then follows that 





= (1 


- e) (1 - H2 ill)) , 


I{A)BX)^ 


= (1 


- 2e) ifi) , 


\l{A-B\X)^ 


= (1 




\l{A-.E\X)^ 


= (1 


- e) H2 ifi) , 


I{AX-B) 


-(1 





A quick glance over the above information quantities re- 
veals that exploiting coding strategies such as the classically- 
enhanced father protocol gives no improvement over time- 
sharing because H2 {n) varies between zero and one as ji 
varies between zero and 1/2 (the classically-enhanced father 
protocol gives exactly the same performance as time-sharing, 
as does the classically-enhanced quantum communication 
strategy of Devetak and Shor |7|). Thus, the region obtained 
as the union of the one-shot, one-state regions is indeed 
equivalent to the outer bound given above. Figure |3] plots this 
region for a quantum erasure channel with erasure parameter 
e = 1/4, demonstrating that this region is equivalent to the 
outer bound. ■ 

The following corollary applies to the noiseless qubit chan- 
nel by simply plugging in e = 0. 

Corollary 4: The following set of inequalities specifies the 
entanglement-assisted capacity of the noiseless qubit channel 



17 



for transmitting classical and quantum information: 

c + 2g < 2, 
C + Q<E + l. 

C. The Qubit Dephasing Channel 

In this section, we show that we can compute the full 
capacity region of a qubit dephasing channel and plot it in 
Figure]?] for a channel with dephasing parameter p = 0.2. We 
show also that the classically-enhanced father protocol can 
beat time-sharing for a qubit dephasing channel (the example 
is an extension of the argument in Ref. |71). 

1) Single-Letterization: We first show that the classically- 
enhanced father trade-off curve is optimal in the sense that it 
hes along the boundary of the capacity region for the qubit 
dephasing channel. A surprisingly simple argument proves this 
result by resorting to the result of Devetak and Shor in Ref. Q. 
There, they showed that the following trade-off curve in the 
CQ-plane is optimal: 



{{CcQ {fi),QcQ (Ai),0) :0</i< 1/2} 



(70) 



where 



CcQ (m) 

QcQ (m) 

9 {P, m) 



1 - H2 (m) , 

H2{^i)-H2{9{p,^l)).. 
1 



2 2V 2 



16.? 



1 



Now, consider the surface formed by the following set of 
points: 

{(CcQ (m) , QcQ {^J)+E,E)■.Q<^x< 1/2, E>{)}. 

(71) 

This surface is an outer bound for the capacity region (if it 
were not so, one could combine points outside this surface 
with entanglement distribution and beat the optimal bound in 
(|70jl for the Devetak-Shor case). 



Now consider sending the /i-parametrized ensemble in ( 69 1 



where /i S [0, 1/2], through the qubit dephasing channel with 
dephasing parameter p. It is straightforward to show that the 
various entropic quantities in the classically-enhanced father 
protocol are as follows for the /x-parametrized ensemble: 



CcEF (m) 
QCEF (m) 

EcE¥ (m) 



I{X-B 
1 



I{A-B\X) 



1 - H2 ill) , 

H2 (m) - 
1 



-H2 {giP.y)) : 



I{A■E\X) = -H2{9{p,^Ji)) 



Thus, the following set of points contains all points along the 
classically-enhanced father trade-off curve: 



{(CcEF (A^) : 



^CEF 



(Ai),ScEF(M)) :0<M< 1/2} 



All points along the classically-enhanced father lie along 
the boundary because they are of the form in (71 1 with 



E = H2 (g {p, 11)) /2. This proves that the points along the 
classically-enhanced father trade-off curve are optimal. One 



can also achieve any point along the surface in ( 7 1 1 with entan- 
glement consumption below the classically-enhanced father by 



combining the classically-enhanced father with entanglement 
distribution. 

We now outline the proof that Shor's trade-off curve 
for entanglement-assisted classical communication single- 
letterizes for the qubit dephasing channel (full details appear 
in Ref. [34 J — the argument complements the argument in Ap- 
pendix B of Ref. iQ). Any point along Shor's trade-off curve 
achieves a classical communication rate of I{AX;B'"') at an 
entanglement consumption rate of H {A\X) lfT4ll . Therefore, 
to determine a point along the trade-off curve, we would like to 
maximize the classical communication rate while minimizing 
the entanglement consumption rate. To do so, we can define 
the following function 

h (AA^") = max (/ [AX; B") - XH {A\X)) , 

where A > and the maximization is over all states of the form 



(15 1, with the exception that the E' system is not necessary 
for Shor's trade-off curve [14|. By a sequence of arguments 
similar to those in Appendix B of Ref. Q, we can show that 



fx (AA^") < nh^ (AA) 



where 



hx (A/") = max {H (Y) + {1 - X) H {A\X) - H {E\X)) , 

Y is the completely dephased version of B, and cr^ is a state 
that arises after sending the A' system of a state of the form 



in (69 1 through a single use of the qubit dephasing channel. 



This then shows that the region single-letterizes and that states 



of the form in ( 69 1 give rise to optimal points that lie along 



Shor's trade-off curve. Shor's trade-off curve in the CE-plane 
has the following form: 



{{Cce{i^),0,Ece (m)) : < M < 1/2} 



(72) 



where 



CCE (m) 

EcE (m) 



1 + H2 
H2 (m) . 



H2 {9 



We can now exploit Shor's trade-off curve to outline a 
bounding surface in the CQE space (just as we did before 
with the Devetak-Shor curve and entanglement distribution). 
Consider the surface formed by the following set of points: 

{{CcE (/i) - 2S, E, EcE {p)-E):0<^i< 1/2, E > 0} . 

(73) 

This surface is an outer bound for the capacity region (if it 
were not so, one could combine points outside this surface 
with super-dense coding and beat the optimal bound in ([72|). 
Interestingly, this surface intersects the surface in (71\ at 
exactly the classically-enhanced father trade-off curve. 

We can finally outline the full capacity region by combining 
the two surfaces in ( |7T| and ( |73] l with the bound: 



C + 2Q < 2 - i/2 (5 {p, 1/2)) , 



(74) 



The above bound is the largest that the entanglement-assisted 
classical capacity can be and therefore bounds the sum rate 
C + 2Q as we have argued previously. The intersection of 
these three surfaces forms a single-letter bound for the capacity 



18 



01 



"D 

6' 



Qu, 




region. For the third time-sharing strategy, time-sharing is the 
optimal strategy. If it were not (in the sense that one could 
achieve a higher quantum or classical rate than a point along 
the time-sharing bound), then one could beat the bound in 



1 0.5 
Classical communication rate 



Fig. 4. (Color online) The above figure plots the full capacity region for the 
qubit dephasing channel with dephasing parameter p = 0.2. It outlines Shor's 
trade-off curve, the Devetak-Shor (DS) trade-off curve, and the classically- 
enhanced father (CEF) trade-off curve. The surface between Shor's curve and 
the CEF curve is that in j73|. The surface between the CEF curve and the DS 
curve is that in {71) . Finally, {14) specifies the solid plane. This region is a 
union of regions formed by translating the unit resource capacity region from 
Ref. 1261 along the classically-enhanced father trade-off curve. This point is 
perhaps more clear in Ref. L26J where we plot the full triple trade-off. 



region, and all points on the boundary are achievable by 
combining the classically-enhanced father trade-off curve with 
entanglement distribution, super-dense coding, or the wasting 
of entanglement. Figure |4] plots the full capacity region. 

2) The Classically-Enhanced Father Protocol can beat 
Time-Sharing: An important question for entanglement- 
assisted classical-quantum coding is whether a time-sharing 
strategy is optimal for all channels or if the classically- 
enhanced father protocol can give an improvement over time- 
sharing. There are three time-sharing strategies that one could 
employ in EACQ coding. In all three strategies, we suppose 
that the sender and receiver share some finite amount of 
entanglement E. The three strategies are as follows: 

1) Use an entanglement-assisted quantum code with rate 
triple (0, and an HSW code with rate triple 
(C2,0, 0). If E — XEi, then time-sharing produces an 
EACQ code with rate triple ((1 - A) C2,XQi,E). 

2) Use an entanglement-assisted classical code with rate 
triple {C 1,0, El) and a quantum channel code with rate 
triple (0, Q2, 0). If E = XEi, then time-sharing produces 
an EACQ code with rate triple (ACi, (1 - A) Q2,E). 

3) Use an entanglement-assisted quantum code with rate 
triple (0,Qi,£'i) and an entanglement-assisted classi- 
cal code with rate triple ((^2,0, i?2)- If i? = XEi + 
(1 — X) E2, then time-sharing produces an EACQ code 
with rate triple ((1 - A) C2,XQi,E). 

We should compare the classically-enhanced father protocol 
to the first time-sharing strategy because the two points EAQ 
and HSW are special cases of it. For the second time-sharing 
strategy, it is clear that this strategy is not optimal because 
the line connecting EAC to LSD is strictly inside the capacity 



(12 1 by combining this protocol with either teleportation or 
super-dense coding. 

We now consider the first case for the qubit dephasing 
channel and show that the classically-enhanced father protocol 
can beat a time-sharing strategy. Consider the qubit dephasing 
channel with dephasing parameter p = 0.2. The classical 
capacity of this channel is one bit per channel use, and the 
entanglement-assisted quantum capacity is about 0.7655 qubits 
per channel use while using about 0.2345 ebits per channel 
use. The solid red line in Figure |5j a) corresponds to the time- 
sharing line between these two optimal points. The blue dotted 
line in Figure [5ja) corresponds to the various points along the 
classically-enhanced father protocol. In comparing the time- 
sharing line to the classically-enhanced father trade-off curve, 
we see that the classically-enhanced father protocol achieves 
more quantum communication for less entanglement consump- 
tion for any point along the time-sharing line that achieves the 
same amount of classical communication. Figure |5jb) makes 
this statement precise by comparing the difference in quantum 
communication and entanglement consumption for all points 
along the trade-off curve that achieve the same amount of 
classical communication as a time-sharing point. 



VIII. Conclusion 

We have proven the entanglement-assisted classical and 
quantum capacity theorem. This theorem determines the ul- 
timate rates at which a noisy quantum channel can com- 
municate both classical and quantum information reliably, 
while consuming entanglement to do so. The coding strat- 
egy exploits a new entanglement-assisted classical-quantum 
coding strategy, the classically-enhanced father protocol, and 
the unit protocols of teleportation, super-dense coding, and 
entanglement distribution. Several protocols in the family tree 
of quantum Shannon theory are now child protocols of the 
classically-enhanced father. We also have provided example 
channels whose corresponding CQE capacity regions single- 
letterize, so that we can actually determine the region for these 
channels, and we have shown that classically-enhanced father 
protocol beats a time-sharing strategy for the case of a qubit 
dephasing channel. We discuss follow-up work and several 
open problems in what follows. 



A. The Full Triple Trade-off 

The present article addresses only one octant of the channel 
coding scenario — the octant where we consume entanglement 
and generate classical and quantum communication. We char- 
acterize the full triple trade-off region in Ref. |26|, where we 
show that the classically-enhanced father protocol combined 
with the unit resource protocols in ( 6T||63 1 achieves the full 
capacity region for all octants. 
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Extra quantum communication 




(a) (b) 

Fig. 5. (Color online) (a) The figure on the left displays the points achievable by time-sharing between entanglement-assisted quantum coding and classical 
coding on the solid red line, and it displays the points achievable with the classically-enhanced father protocol on the dotted blue line. The channel for which 
we are coding is the qubit dephasing channel with dephasing parameter p = 0.2. The figure demonstrates that one can achieve more quantum communication 
with less entanglement consumption, while having the same rate of classical communication, by employing the classically-enhanced father protocol instead 
of a time-sharing strategy, (b) The figure on the right makes the previous statement precise, by showing the difference between quantum communication and 
entanglement consumption for achievable points on the classically-enhanced father trade-off curve that attain the same rate of classical communication as a 
time- sharing strategy. 



B. The Structure of Classically-Enhanced Father Codes 

In Ref. EOl . one of the authors constructed a classically- 
enhanced father code that uses only ancilla qubits for encoding 
classical information. In Ref. |fT9l . the other author constructed 
a classically-enhanced father code that uses both ancilla qubits 
and ebits for encoding classical information. One might think 
that using ebits in addition to ancilla qubits for encoding 
classical information could improve performance and it was 
unclear which coding structure might perform better 

The structure of our classically-enhanced father protocol 
actually gives a hint for constructing classically-enhanced 
father codes that achieve the rates in Theorem [T] Consider 
the protocol in the proof of the direct coding part of Theo- 
rem [T] Bob decodes the classical information by measuring 
the channel outputs only. He does not need to measure his 
half of the entanglement to decode the classical information. 
This decoding implies that he is not using the entanglement for 
sending classical information — if he were, he would need to 
measure his half of the entanglement as well. This observation 
lends creedence to the conjecture that it is sufficient to encode 
classical information into ancilla qubits when attempting to 
construct codes that achieve the trade-off rate triple in Theo- 
rem [l] 

C. Other Issues 

Another issue remains with the "pasting" proof technique. It 
relies on the assumption that the channel is IID and thus does 
not apply in a straightforward way to channels with memory. 
Many proof techniques in quantum Shannon theory rely on a 
"one-shot" lemma applied to the IID case. The usefulness of 



this method of proof is that the one-shot result can apply to 
more general scenarios such as channels that have memory. 
So it may be useful to develop a one-shot result for the code 
pasting technique. 
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Appendix A 
Proof of Proposition |2] 

The proof of Proposition [2] is an extension of the develop- 
ment in Appendix D of Ref. [6|. 

Proposition^ Consider an arbitrary density operator 
whose spectral decomposition is as follows: 

xex 

The n*'' extension of the above state as a tensor power state 
is as follows: 
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We define the pruned distribution p'" as follows: 







X" (x") : a;' 



else, 



where T/*^ denotes the (5-typical set of sequences with length 
n. Let " denote the following "pruned state": 



lA"* 



(75) 



a;" ST/ 



4 fn 

For any e > and sufficiently large n, the state p is close 
to by the gentle measurement lemma ll35l and the typical 
subspace theorem [8|: 



P 



P 



'A'" 



< 2e. 



For any density operator p^ , it is possible to construct an 
entanglement-assisted quantum code that achieves the quan- 
tum communication rate and entanglement consumption rate 
in Proposition |2] Ref. IITSl provides group-theoretical and 
other clever arguments to show how to achieve the rates 
in Proposition |2] Another method for achieving the rates in 
Proposition |2] is to exploit the connection between quantum 
privacy and quantum coherence in constructing quantum codes 
lISl, ll36l . Indeed, in Ref. Il37l . one of the current authors 
showed how to construct secret-key-assisted private classical 
codes for a quantum channel. Using the methods of |6|, |36J, 
it is possible to make "coherent" versions, i.e., entanglement- 
assisted quantum codes, of these secret-key-assisted private 
classical codes. Let [k] denote a set of size ^ 2"'^ and let [m] 
denote a set of size - 2"^. Let Uk,m denote - 2"('3+^) 
random variables that we choose according to the pruned 
distribution p'" (x"). The realizations u^.m of the random 
variables Uk.m are sequences in A"" and are the basis for 
constructing an entanglement-assisted quantum code C whose 
codewords are as follows 



The entanglement-assisted quantum codewords \4>k)'^ in C 
are as follows 



10* 



,A"Tb _ 



where 



We then expurgate this code to improve its performance and 
this expurgation has a minimal impact on the rate of the 
code. After expurgation, the code forms a good entanglement- 
assisted quantum code, resulting in failure with probability 
e + lOv^ by the arguments in Refs. ID, ll36l . Suppose that 
we choose a particular entanglement-assisted quantum code C 
according to the above prescription. Its code density operator 
is 



1 



2"« 
k=l 



and its input code density operator is 



/'"(C) 



Tr7 



1 



2n{Q+E) 



\A'" 



m— 1 k — 1 



Suppose we now consider the entanglement-assisted code 
chosen according to the above prescription as a random code 
C (where C is now a random variable). Let p'^ " (C) be the 
channel input density operator for the random code before 
expurgation and p"^ (C) its channel input density operator 
after expurgation: 



JA'" 



2 niQ'+E') Y J2 l<^t/..™)(<^t 



I A'" 



(C) ^ 



1 

2"(Q+s) 



fe=l m=l 

2^0 2"-B 



I A'" 



k—1 m—1 



where the primed rates are the rates before expurgation and the 
unprimed rates are those after expurgation (they are slightly 
different but identical for large n). Let p'^ and p"^ denote 
the expectation of the above channel input density operators: 



P^'" = Ec 



{p^'"iC)}. 



Choosing our code in the particular way that we did leads to an 
interesting consequence. The expectation of the density oper- 
ator corresponding to Alice's restricted codeword \4>Uk m)'^ " 
is equal to the pruned state in ( [75| ): 



|A'" 



because we choose the codewords | (pu^ ^ ) randomly according 
to the pruned distribution p'" (x"). Then the expected channel 
input density operator p'"^ is as follows: 



-/A'" 



= E, 



2niQ' + E') E T. ^c{\(bL 



k—1 m—1 
\ I ± |A'" 



(76) 

J^'"} (77) 



(78) 



Then we know that the following inequality holds for p' 
and the tensor power state p-^ 



-lA'" 



P 



A"* 



< 2e 



(79) 



by the typical subspace theorem and the gentle measurement 
lemma. The expurgation of any entanglement-assisted code C 
has a minimal effect on the resulting channel input density 
operator |6|: 



<4a 
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The above inequality implies that the following one holds for 



the expected channel input density operators p 



and 

(80) 



because the trace distance is convex. The following inequality 
holds 



< 2e 



(81) 



by applying the triangle inequality to ( [79| ) and ( [80| . Therefore, 
the random entanglement-assisted quantum code is p-like. ■ 

Appendix B 
Proof of Proposition[3] 

We now prove Proposition [3] that applies to a random father 
code that has an associated classical string. 

Proposition [ij The proof is similar to the proof of 
Proposition 5 in Ref. Q. Suppose that we have an en- 
semble {px,Px} where each density operator has a 
purification ip^^ and state (j)^^^ = U^'^^^{ip^^ ) arising 
from the channel J\f"^ . By Proposition [2] for sufficiently 
large n and for all a; G A", there exists a random -Uke 
entanglement-assisted {n[px — (5],e) code of quantum rate 
Qx = I{A; B)^_^/2 — S and entanglement consumption rate 
Ex — I {A; E)i /2 + 5. Its expected channel input density 



operator 



is close to a tensor power of the state p^ 



A' . 



rx 



■S] 



< €. 



The code's quantum rate is Qx = \l{A\ B)^^ — 6 because 
it transmits n [px — S] Qx qubits for n [px — S] uses of the 
channel. The code's entanglement consumption rate is Ex = 
|/(v4; E)if,^ + S because it consumes at least n [p^ — S] E^ 
ebits for n [px ~ S] uses of the channel. We produce an 
{n — \X\6,\X\e) entanglement-assisted code with expected 
channel input density operator 



)Px 



by "pasting" jA"! of these codes together (one for each x). Ap- 
plying the triangle inequality \X\ times, the expected channel 
input density operator p"^ * ' ' ' of the pasted code is close 



to a pasting of the tensor power states {px 



5n[px — 5] 



< \X\e. 



Consider the classical sequence x". Let random variable X 
have the probability distribution p and define the typical set 



X" 



{x" : Vx \nx - npx\ < Sn}, 



where = N{x\x") is the number of occurrences of the 
symbol x in x". If x" lies in the typical set , then we can 
construct a conditional permutation operation that permutes 
the elements of the input sequence as follows 



X Xi ■ ■ ■ Xi X2 ■ ■ ■ X2 ■ ■ ■ X\x\ ■ ■ ■ 



where Xg (for "x garbage") denotes the remaining njA'IJ 
symbols in .t". The density operator px^ corresponds to the 
input sequence x" . We can construct a conditional permutation 
unitary that acts on the density operator px^ and changes the 
ordering of the state px^ as follows: 



Px" 



where dim(^pxg) < n\X \ 6\ogdA' ■ We modify the random 
entanglement-assisted code slightly by inserting \X\d "garbage 
states" with density operator px^ and define the expected 
channel input density operator p^ for the full code as 
follows: 



P 



Then the expected channel input density operator p^ is close 
to the permuted version of px^ 



A'" 



\p'^P--'^®Px, 



< \X\e. 



The quantum rate Q for the random "pasted" father code is 
as follows: 

J2x "^Qx [px - <5] 



Q 



^ Qx [Px - S] 

X 

(I{A-B), 



Px 



\ 2 

X ^ 

I{A;B\X 



-5]-5Qx 



c'S, 



where 



c' = 1 + 



The entanglement consumption rate E is as follows: 

J2x nE^[p.^ - 5] 



E = 



Ex [px - s] 



V 2 
I{A;E\X) 



5Ex 



C 0, 



(82) where 



c"^1 + J2Ex. 



A permutation relates the states px^ and {^p"'^^ "^^ (8) Pxg- 

X 

Therefore, there exists an (n, \X\e) random entanglement- 
asissted code of the same quantum communication rate and 
entanglement consumption rate with an expected channel input 
density operator p'^ that is close to the tensor power state 

Px 

1 

because the action of the IID channel A/^"^" is invariant under 
permutations of the input Hilbert spaces. ■ 
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Appendix C 
Gentle Measurement for Ensembles 

Lemma 1 (Gentle Measurement for Ensembles): Let 
{Px^ Pa:} be an ensemble with average p = "YlxP^P^- Given a 
positive operator X with X < I and Tr{pX} > 1 — e where 
e < 1, then 

^Vx\\Px-^/Xpx^/^ <\/8e. 
II 1 

X 

Proof: We can apply the same steps in the proof of 
the gentle measurement lemma [39l to get the following 
inequality: 

- ^/Xp,^/x\^^ < 8 (1 - Tr{p,X}) . 
Summing over both sides produces the following inequality: 
J^Px - < 8 (1 - Tr{pX}) 

X 

< 8e. 

Taking the square root of the above inequality gives the 
following one: 

Concavity of the square root implies then implies the result: 



Px - 4xpx4x 



< VSe. 



Appendix D 

Entanglement Consumption Rate of the EAC 
Classical Capacity 

We prove that the entanglement consumption rate corre- 
sponding to the maximal EAC rate is one ebit. Consider 
a general qubit density operator that Alice can input 
to the erasure channel. Let ip"^^ denote the purification of 
p^ . Suppose that p has the spectral decomposition p = 
p\<Po){4'o\ + (1 — p)\4'i) {(l>i\ for some orthonormal states 
\(f>()), After Alice transmits this density operator through 
an erasure channel with erasure parameter e, Bob has the 
following state: 



and Eve has 



(l-6)p + e|e)(e|, 



ep+(l-e)|e)(e|, 



where |e) is an erasure state. The entropies H{A), H{B), and 
H{E) are as follows: 

H{A) = H2{p), 

H{B) = {l-e)H2{p)+H2{e), 
H{E)^eH2{p) + H.2{e), 

and the mutual information I{A\ B) is as follows: 

I{A- B) = H{A) + H{B) - H{E) = 2(1 - e)H2{p). 



This quantity is maximized only when p = ^, implying that 
the entanglement consumed for this state is exactly one ebit 
because H{A) = i?2(p)- Thus, Alice and Bob cannot consume 
entanglement at a lower rate than this amount in order to 
achieve the EAC capacity. 

Appendix E 

Isometric Encodings suffice in the CQE Theorem 

We prove that it is only necessary to consider isometric en- 
codings for achieving points in the CQE capacity region. Our 
argument follows the technique of Ref. (14], by showing that 
a protocol can only improve upon measuring the environment 
of a non-isometric encoder. 

Suppose that we exploit the following state that results from 
a non-isometric encoder, rather than the state in ( [T5] l: 

¥BEfj.AA'E' 



). (83) 



The inequalities in ( [T2p4| for the CQE capacity region involve 
the mutual information I{AX; B)^, the Holevo information 
I{X;B)s, and the coherent information I{A)BX)'g. As we 
show below, each of these entropic quantities can only improve 
if Alice measures the system E'. This improvement then im- 
plies that it is only necessary to consider isometric encodings 
in the CQE capacity theorem. 

Suppose that Alice sends the system E' through a com- 
pletely dephasing channel to obtain a classical vari- 
able Y (this simulates a measurement). Let ^^yabe (jgjjQj-g 
this later state, a state of the form: 



-XYABE _ 



\x) 



(84) 

This state is actually a state of the form in ( 15 1 if we subsume 



the classical variables X and Y into one classical variable. 

The following three inequalities each follow from an appli- 
cation of the quantum data processing inequality (or, equiva- 
lently, strong subadditivity): 

I{X;B)^=I{X;B)^<I{XY;B)^, (85) 
I{AX■,B)^=I{AX■,B)^<I(AXY;B)^ (86) 
I{A)BX)^ = I{A)BX)^ < I{A)BXY)^ (87) 

Each of these inequalities proves the desired result for the re- 
spective Holevo information, mutual information, and coherent 
information. 
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